Mathematical Foundations for a Compositional Distributional Model of Meaning
نویسندگان
چکیده
We propose a mathematical framework for a unification of the distributional theory of meaning in terms of vector space models, and a compositional theory for grammatical types, for which we rely on the algebra of Pregroups, introduced by Lambek. This mathematical framework enables us to compute the meaning of a well-typed sentence from the meanings of its constituents. Concretely, the type reductions of Pregroups are ‘lifted’ to morphisms in a category, a procedure that transforms meanings of constituents into a meaning of the (well-typed) whole. Importantly, meanings of whole sentences live in a single space, independent of the grammatical structure of the sentence. Hence the inner-product can be used to compare meanings of arbitrary sentences, as it is for comparing the meanings of words in the distributional model. The mathematical structure we employ admits a purely diagrammatic calculus which exposes how the information flows between the words in a sentence in order to make up the meaning of the whole sentence. A variation of our ‘categorical model’ which involves constraining the scalars of the vector spaces to the semiring of Booleans results in a Montague-style Boolean-valued semantics.
منابع مشابه
Categorical Foundations for Extended Compositional Distributional Models of Meaning
Compositional distributional models of meaning were introduced by Coecke et al. (2010, 2013) with the aim of reconciling the theory of distributional meaning in terms of vector space semantics with the theory of compositional interpretation as one finds it in typelogical grammars. The particular typelogical formalisms employed by Coecke et al. (pregroup grammars, Lambek calculus) have a recogni...
متن کاملCompositional-ly Derived Representations of Morphologically Complex Words in Distributional Semantics
Speakers of a language can construct an unlimited number of new words through morphological derivation. This is a major cause of data sparseness for corpus-based approaches to lexical semantics, such as distributional semantic models of word meaning. We adapt compositional methods originally developed for phrases to the task of deriving the distributional meaning of morphologically complex word...
متن کاملA Compositional Distributional Semantics, Two Concrete Constructions, and Some Experimental Evaluations
We provide an overview of the hybrid compositional distributional model of meaning, developed in [6], which is based on the categorical methods also applied to the analysis of information flow in quantum protocols. The mathematical setting stipulates that the meaning of a sentence is a linear function of the tensor products of the meanings of its words. We provide concrete constructions for thi...
متن کاملNon-commutative Logic for Compositional Distributional Semantics
Distributional models of natural language use vectors to provide a contextual foundation for meaning representation. These models rely on large quantities of real data, such as corpora of documents, and have found applications in natural language tasks, such as word similarity, disambiguation, indexing, and search. Compositional distributional models extend the distributional ones from words to...
متن کاملComparing Meaning in Language and Cognition: P-Hyponymy, Concept Combination, Asymmetric Similarity
In this dissertation we work in the framework of compositional distributional models of meaning to examine a number of asymmetric linguistic phenomena that manifest themselves in language and cognition. These include overextension with respect to concept combination, asymmetry of similarity judgment and hyponymy and typicality. In particular, we make use of the formalism of density matrices, wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1003.4394 شماره
صفحات -
تاریخ انتشار 2010